Picture for Changshuo Wang

Changshuo Wang

Hierarchical Semantic-Augmented Navigation: Optimal Transport and Graph-Driven Reasoning for Vision-Language Navigation

Add code
Jun 01, 2026
Viaarxiv icon

CogniVerse: Revolutionizing Multi-Modal Retrieval-Augmented Generation with Cognitive Reflection and Geometric Reasoning

Add code
May 28, 2026
Viaarxiv icon

Rethinking Video-Language Model from the Language Input Perspective

Add code
May 27, 2026
Viaarxiv icon

Towards Unified Vision-Language Models with Incomplete Multi-Modal Inputs

Add code
May 27, 2026
Viaarxiv icon

Unveiling the Fragility of Vision-Language Models: Multi-Modal Adversarial Synergy via Texture-Constrained Perturbations and Cross-Modal Optimization

Add code
May 26, 2026
Viaarxiv icon

A Synonymous Variational Perspective on the Rate-Distortion-Perception Tradeoff

Add code
Apr 16, 2026
Viaarxiv icon

DiffStyle3D: Consistent 3D Gaussian Stylization via Attention Optimization

Add code
Jan 27, 2026
Viaarxiv icon

MMPG: MoE-based Adaptive Multi-Perspective Graph Fusion for Protein Representation Learning

Add code
Jan 15, 2026
Viaarxiv icon

SplitFlux: Learning to Decouple Content and Style from a Single Image

Add code
Nov 19, 2025
Figure 1 for SplitFlux: Learning to Decouple Content and Style from a Single Image
Figure 2 for SplitFlux: Learning to Decouple Content and Style from a Single Image
Figure 3 for SplitFlux: Learning to Decouple Content and Style from a Single Image
Figure 4 for SplitFlux: Learning to Decouple Content and Style from a Single Image
Viaarxiv icon

FantasyStyle: Controllable Stylized Distillation for 3D Gaussian Splatting

Add code
Aug 11, 2025
Viaarxiv icon